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Four Massachusetts-type (Mass-type) strains of infectious bronchitis coronavirus (IBV) were compared 
genetically with the pathogenic M41 and H120 vaccine strains using the complete genomic sequences. 
The results revealed that strains ck/CH/LNM/091017 and ck/CH/LDL/101212 were closely related to the 
H120 vaccine, which suggests that they might represent re-isolations of vaccine strains or variants of vac- 
cine strains that have resulted from the accumulated point mutations after several passages in chickens. 
In contrast, strains ck/CH/LHLJ/O7VII and ck/CH/LHLJ/100902 had a close genetic relationship with the 
pathogenic M41 strain. In addition, molecular markers have been identified that distinguish between 
field and vaccine (or vaccine-like) Mass-type viruses, which may be able to differentiate between field 
and vaccine strains for diagnostic purposes. Phylogenetic analysis, and pairwise comparison of full-length 
genomes and the nine genes, identified the occurrence of recombination events in the genome of strain 
CK/VH/LHLJ/O7VII, which suggests that this virus originated from recombination events between 
M41- and H120-like strains at the switch site located at the 3’ end of the nucleocapsid (N) genes. To 
our knowledge, this is the first time that evidence for the evolution and natural recombination under field 
conditions between Mass-type pathogenic and vaccinal IBV strains has been documented. These findings 
provide insights into the emergence and evolution of the Mass-type IB coronaviruses and may help to 


explain the emergence of Mass-type IBV in chicken flocks all over the world. 


© 2012 Elsevier B.V. All rights reserved. 


1. Introduction 


In 1931, Schalk and Hawn described “an apparently new respi- 
ratory disease of chicks” in North Dakota in the United States, 
which was considered to be infectious bronchitis (IB) by later 
researchers of avian respiratory diseases (Schalk and Hawn, 
1931). Currently, IB still occurs in nearly all poultry-producing 
countries; it is a highly contagious, acute, and economically impor- 
tant viral disease of chickens. The etiology of IB, which was first 
demonstrated by Beach and Schalm (1936), is infectious bronchitis 
virus (IBV). 

IBV is grouped in the genus Gammacoronavirus of the family 
Coronaviridae in the order Nidovirales (de Groot et al., 2011). The 
coronavirus genomes are the largest among the known RNA 
viruses and are polycistronic, generating a nested set of subgenom- 
ic RNAs with common 5’ and 3’ sequences (Masters, 2006). Like 
those of all other coronaviruses, the 5’ two-thirds of the IBV 
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genome consists of two large replicase open reading frames (ORFs), 
ORF1a and ORF1b. The ORF1a polyprotein (pp1a) can be extended 
with ORF1b-encoded sequences via a —1 ribosomal frameshift at a 
conserved slippery site (Brierley et al., 1987), which generates the 
polyprotein pp1lab, comprising more than 7000 amino acids, which 
includes the putative RNA-dependent RNA polymerase (RdRp) and 
RNA helicase (HEL) activity (Ziebuhr et al., 2001). The ppla and 
pp1ab of IBV are processed autocatalytically by two different viral 
proteases, encoded by a papain-like protease (PLP) and a 3C-loke 
protease (3CLP*°) (Lee et al., 1991; Ziebuhr et al., 2000, 2001). Other 
putative domains, presumably associated with a 3’-to-5’ exonucle- 
ase (ExoN) activity, a poly(U)-specific endo-RNase (XendoU) activ- 
ity, and a 2’-O-methyltransferase (2’-O-MT) activity, have been 
predicted in pp1ab (Ivanov et al., 2004; Snijder et al., 2003). The 
3’ end of a coronavirus genome includes the viral structural and 
accessory protein genes: a spike (S) glycoprotein gene; an envelope 
(E) protein gene; a membrane (M) glycoprotein gene; a nucleocap- 
sid (N) phosphoprotein gene; and several ORFs that encode puta- 
tive non-structural accessory proteins (Masters, 2006). Of the 
virus-encoded proteins, the S1 subunit of the S protein carries 
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virus-neutralizing activity, determines the serotype of IBV and is 
responsible for viral attachment to cells. It is also a major determi- 
nant of cell tropism in culture (Casais et al., 2003). The accumula- 
tion of point mutations, deletions, insertions and recombination 
events that have been observed in multiple structural genes, espe- 
cially the S1 gene, of IBV recovered from naturally occurring infec- 
tions have been considered to contribute to the genetic diversity 
and evolution of IBV, and consequently, to a number of IBV sero- 
types (Cavanagh, 2007). 

The occurrence and emergence of multiple serotypes of the 
virus have complicated control by vaccination because many sero- 
types and variants do not confer complete cross-protection against 
each other (Cavanagh and Gelb, 2008). The originally discovered 
Massachusetts (Mass) type of IBV was identified in the United 
States, beginning in the 1950s (Fabricant, 2000; Johnson and 
Marquardt, 1975; Mondal et al., 2001). Mass-type strains have 
been isolated in Europe and Asia since the 1950s and up to the 
present day (Cavanagh and Gelb, 2008), together with dozens of 
other serotypes that have been isolated in Africa, Asia, India, 
Australia, Europe, and South America (Cavanagh, 2001, 2003, 
2005). The first Mass-type ‘“H” vaccines were developed in about 
1960. They include H120 and H52 (Bijlenga et al., 2004), and are 
used very commonly and widely around the world. However, virus 
of this type is occasionally isolated from Massachusetts-vaccinated 
and -unvaccinated flocks with respiratory clinical signs. Some of 
the viruses have shown close genetic relationships with pathogenic 
Mass-type, rather than vaccine, strains by S1 gene analysis. 
However, conclusions based on the genetic analysis of a single 
gene sequence, and sometimes even a partial gene sequence, 
require caution because the true phylogeny can only be demon- 
strated by analyzing complete genomic sequences. Herein, we 
sequenced the complete genome of four IBV Mass-type strains that 
showed S1 gene diversity (Liu et al., 2009; Ma et al., 2012; Sun 
et al., 2011), and we present evidence for in-field recombination 
between pathogenic and vaccinal strains. Furthermore, we charac- 
terized the molecular variability of the four Mass-type strains to 
gain insight into the emergence and evolution of these viruses. 


2. Materials and methods 
2.1. Virus strains 


Four Mass-type IBV strains were used for complete genomic se- 
quence comparison and analysis in this study. Strain ck/CH/LHL]/ 
O7VII was isolated in 2007 from the kidney of a layer hen vacci- 
nated with H120 and 4/91 in Heilongjiang province, China (Liu 
et al., 2009). Strain ck/CH/LNM/091017 was isolated in 2009 from 
the swollen proventricular tissues of a broiler vaccinated with 
H120 in Neimenggu province, China (Sun et al., 2011). Strains ck/ 
CH/LDL/101212 and ck/CH/LHLJ/100902, both of which were iso- 
lated in 2010, were obtained from laying hens in Dalian and Hei- 
longjiang provinces, respectively, in China; the birds were 
suffering from nephropathogenic lesions and respiratory signs, 
respectively. In addition, the diseased birds in both flocks were suf- 
fering from proventriculitis (Ma et al., 2012). All of the IBV strains 
have been associated with various IB outbreaks in recent years in 
China and were assigned to the Mass-type strains by S1 sequence 
analysis. To avoid the possible mutation in the viral genome after 
serial passages in specific-pathogen-free (SPF) embryonated chick- 
en eggs, the first passage of each original virus stock was used and 
purified once by propagating in 9- to 11-day-old SPF chicken eggs 
with a dose of 104-fold dilutions per egg, and the presence of viral 
particles in the allantoic fluids of inoculated eggs was confirmed 
with a negative contrast electronic microscope (JEM-1200, EX) 
and reverse transcriptase-polymerase chain reaction (RT-PCR) as 


described previously (Han et al., 2011). In addition, since these 
viruses were isolated from chickens vaccinated with H120, it is 
possible that mixed IBV infections are present in one chicken flock. 
To exclude this, nine clones of S1 gene of each virus obtained from 
three independent PCR reactions were sequenced and compared. 
Sequences of each virus identical to the previously results were ob- 
tained (Liu et al., 2009; Ma et al., 2012; Sun et al., 2011). 


2.2. Eggs 


Fertile White Leghorn SPF chicken eggs were obtained from the 
Laboratory Animal Center, Harbin Veterinary Research Institute, 
the Chinese Academy of Agricultural Sciences, China. 


2.3. RNA extraction and RT-PCR 


To determine the full-length genomic sequences of the four 
viruses, 15 pairs of overlapping primers encompassing the entire 
genome were used. The primers were designed in regions that 
are conserved among most of the IBV strains available in the Gen- 
Bank database. The sequences and locations of the primers used in 
RT and PCR in this study are presented in Table 1. 

Viral RNA was extracted from 200 ul of infectious allantoic fluid 
using TRIzol reagents (Invitrogen, Grand Island, USA), following the 
manufacturer’s protocol. Complementary DNA (cDNA) was synthe- 
sized using 80 ul of the first strand mixture (Invitrogen) containing 
20 uM of primers N (—), 0.5 mM each of dNTP (TaKaRa, Dalian, Chi- 
na) and 40 ul of total RNA. The mixture was incubated at 70 °C for 
5 min and then quick-chilled on ice for 2 min. The RT master mix 
was composed of 16 ul 5x RT buffer (Invitrogen), 8 pl 10mM 
DTT, 200U of M-MLV Reverse Transcriptase (Invitrogen), and 
20 U RNAse inhibitors (Invitrogen). This RT master mix was incu- 
bated at 37 °C for 3 h. The reaction was terminated by heating at 
70°C for 10 min then chilling on ice for 5 min. 

The PCR was performed in a 25 pl reaction containing 2 tl first 
strand cDNA; 15 nmol each of downstream and upstream primers; 
5 ul of 10x PCR buffer (Mg?* Plus, TaKaRa); 4 pl of 2.5 mmol 
dNTPs; 2 U Taq polymerase (TaKaRa); and 18 il of water. The reac- 
tion was conducted at 95 °C for 5 min, and 30 cycles of 94°C for 
1 min; 50°C for 1 min; 72 °C for 2 min, and a final extension step 
of 72°C for 10 min. A product, detectable by ethidium bromide 
staining, of the expected size was generated. 


2.4, The 5’- and 3’-ends of the genome 


A cDNA clone representing the 5’ and 3’ ends of the genome of 
the four IBV strains was synthesized according to the 5’ RACE and 
3’ RACE System for rapid amplification of cDNA ends (TaKaRa). PCR 
was performed according to the instructions accompanying the 
kits. The sense and antisense primers used to amplifying the 5’- 
and 3’-ends of the genome had been designed on the basis of the 
sequences obtained above that were constant in the four IBV 
strains, respectively. The outer and inner primers used to amplify 
the 5’-end of the four IBV strains were 5’-CAGCTATGGCAATGCG 
CAG-3’ and 5’-CATCTTTGGTGTCTCA/TCC-3’, respectively. The pri- 
mer used to amplify the 3’-end was 5’-GAGGAGAGGAACAATGC 
ACA-3’. 


2.5. DNA cloning and sequence determination 


The DNA generated by PCR amplification was cloned using a T- 
tailed vector, pMD18-T (TaKaRa), and transformed using JM109 
competent cells (TaKaRa) according to the manufacturer’s instruc- 
tions. At least five clones of each fragment in each strain were se- 
quenced and the consensus sequence was determined. The 
sequences were analyzed using the Sequencher 4.5 sequence 
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Table 1 

Oligonucleotide primers used for IBV genome amplification and their positions in the genome. 
Primer Sense* Sequence (5’ to 3’) Position in genome? 
IBV-366 + AACCCAAAAGATTACGCTGATGCTT 640-664 
IBV-367 - CCAGAATTCAAAAGACTTTICAGTA 1887-1911 
IBV-368 + GATGTCTTGAAGCTGTTTCAATC 1828-1850 
IBV-369 - AAGAATGTGCTTGTAGATGGCGT 3817-3839 
IBV-370 + TATGTTAAGAAACATGGGCCAC 3685-3706 
IBV-371 - TACATTTATGTCTCCTGAAGTTIGCTGG 6405-6431 
IBV-372 + GTAAGAGACATAATTIGGTATIG 6353-6374 
IBV-373 - GGTGCAGTTTTAATTITTACAAACTGC 9052-9077 
IBV-377 + CTTAATCTTGCTAATAATCATGAG 8971-8994 
IBV-378 - GTGTACAACTGTTGTTTTAATGCAGC 10,686-10,711 
IBV-379 + GTGTGGGAAGTCTTTTCGACAAATATAC 10,588-10,615 
IBV-380 - GGACGTTGTCAATTTAAAGGTTC 12,100-12,122 
IBV-381 + CTCCTGATCAGGATTCTTATGGAGGAGC 12,011-12,038 
IBV-382 - CCTGTGGCTGTTATGGAGCGTTATAT 14,835-14,860 
IBV-383 + CATTTTGGGTGCATGTGTTTTTGTAGATG 14,789-14,817 
IBV-384 - GACACCATGTGACAAACACTACACAATCTG 17,407-17,436 
IBV-375 + CCTTGGCATGTTATAAGACCAAGGATAG 17,346-17,373 
IBV-376 - CGCTCCATCTTTTAATGGCATACCATICTG 18,903-18,932 
IBV-374 + GTTCTGTATGATGATAGATATGGTGATTACC 18,780-18,810 
IBV-105-1 - CCAAAGTGCCTTTAGACCGGCTG 20,388-20,412 
IBV-257 + TATTGATTAGAGATGTGG 20,356-20,373 
IBV-275 - GTTACAGATGAGTACATAC 22,147-22,165 
IBV-167 + GCTTCTTGAGAA(T/C)CAGTTTTA 21,921-21,941 
IBV-168 - AGACGATCAACTTGTGCATCTG 22,952-22,973 
IBV-182 + GACATTTAC(C/G)(A/C)GCAACTTGA 22,921-22,940 
IBV-281 - GCCACTGACC(C/A)TCACAATAAAG 24,955-24,976 
IBV-280 + CCC(C/A)GAATCTAATGCCGTAGG 24,846-24,866 
IBV-171 - AACCAAGATGCATTTCCAGA 25,960-25,979 
N(+) * GACGCCCCAGCGCCAGTCATTAAA 25,903-25,926 
N(-)f - ACGCGGAGTACGATCGAGGGTACA 27,484-27,507 


* Negative-sense (—) or positive-sense (+) primer. 
> Nucleotide positions corresponded to the sequence of the IBV Beaudette genome, GenBank accession number M95169. 
© Primer N(—) was used for RT. 


Table 2 
IBV and TCoV strains used for genome sequence comparison in the present study. 
Virus GenBank accession Nos. Virus GenBank accession Nos. 
A2 EU526388 CK/CH/LDL/971 AF345267 
SAIBK DQ288927 Gray GU393334 
$C021202 EU714029 FL18288 GU393333 
BJ AY319651 Conn46 1991 FJ904719 
CK/CH/LSD/05I EU637854 DY07 HM245923 
Partridge/GD/S14/2003 AY646283 GX-YL 5 HQ848267 
Peafowl/GD/KQ6/2003 AY641576 cQo04 HM245924 
ZJ971 EU714028 GX-YL 9 HQ850618 
TW2575/98 DQ646405 JMK GU393338 
H52 EU817497 Cal 1995 FJ904714 
H120 FJ888351 Georgia 1998 Vaccine GQ504723 
Mass 41 AY851295 Mass41 1965 FJ904720 
California 99 AY514485 Delaware 072 GU393332 
Beaudette NC_001451 Arkansas DPI GQ504720 
ArkDPI11 EU418976 ck/CH/LZJ/111113 JX195176 
lowa 97 GU393337 Sczy3 JF732903 
TCoV/MG10 NC_010800 Holte GU393336 
TCoV/TX/1038/98 GQ427176 ck/CH/LDL/091022 JX195175 
TCoV/IN/517/94 GQ427175 YN JF893452 
TCoV/TX/GL/01 GQ427174 Cal56b GU393331 
TCoV/VA/74/03 GQ427173 ck/CH/LHB/100801 JF330898 
TCoV/540 EU022525 ITA/90254/2005 FN430414 
TCoV/ATCC EU022526 NGA/A116E7/2006 FN430415 
analysis program, and a single contiguous sequence comprising the those of other reference IBV and Turkey coronavirus (TCoV) strains 
entire IBV genome of each of the four IBV strains was constructed. using the MEGALIGN program in DNAStar (version 7, Lasergene 
Corp, Madison, WI). The ORFs were determined using the Gene 
2.6. Phylogenetic relationships and sequence comparison Runner program version 3.00 (http://www.generunner.com) by 
comparison with those of other reference IBV and TCoV strains. A 
The nucleotide and amino acid sequences of the entire genome total of 39 IBV and 7 TCoV reference strains, for which entire geno- 


of the four IBV strains were assembled, aligned, and compared with mic sequences were available in GenBank database, were selected 
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Fig. 1. Genome organization of four Mass-type IBV isolates. Diagrammatic representation of the genome organization shows the predicted genes and their relative sizes and 
positions along the IBV genome. S, spike glycoprotein gene; 3a, 3b, and 3c (E), tricistronic gene 3; M, membrane protein gene; 5a and 5b, bicistronic gene 5; N, nucleocapsid 
protein gene; UTR, untranslated region. Top: expanded representation of the two ORFs (ORFla and ORF1b) comprising the polycistronic Gene 1 and the likely cleavage 
products and cleavage sites after proteolytic processing of the 1a/1b polyprotein. Bottom: expanded representation of the 3’-end structural and accessory protein genes. 


for phylogenetic analysis of full-length genomes. The selected 
avian coronavirus reference strains and their accession numbers 
are provided in Table 2. Phylogenetic analysis, accurate estimation 
and comparison of the 5’-UTR, Gene 1, S1, S2, Gene 3, M, Gene 5, N 
and 3’-UTR of the four IBV strains was conducted with those of the 
Mass-type strains selected in this study using the Clustal V method 
of DNAStar software and MEGA4 (Liu et al., 2009), and the align- 
ments were edited manually and adjusted to remove mistakes. 
Deletion, insertion and gene recombination were determined 
according to the results of the phylogenetic analysis and pairwise 
comparisons. 


2.7. GenBank accession numbers 


The full genomic sequences of the four Mass-type IBV strains 
described in this report have been deposited in the GenBank data- 
base with accession numbers ck/CH/LNM/091017 JF330899, CK/ 
CH/LHLJ/O7VII JF274479, ck/CH/LDL/101212 JF828981 and CK/ 
CH/LHLJ/100902 JF828980. 


3. Results 


3.1. Complete genomic sequence and organization of the four Mass- 
type IBV strains 


Four Mass-type IBV strains were subjected to genome sequenc- 
ing and phylogenetic analysis in this study. The sequences of each 
the four strains were assembled into one contiguous sequence to 
represent the entire viral genomes. Sequences of 27630, 27678, 
27630 and 27473 nucleotides were obtained from strains ck/CH/ 
LNM/091017, ck/CH/LHLJ/O7VII, ck/CH/LDL/101212 and ck/CH/ 
LHLJ/100902, respectively, excluding the polyadenylation tail at 
the 3’ end. The genomes of the viruses were similar overall in their 
coding capacity and genomic organization to those of other IBVs. 
The genome of each of the viruses contained two large slightly 
overlapping ORFs in the 5’ two-thirds of the genome and multiple 
additional ORFs in the 3’ one-third of the genome. Both termini 
were flanked with untranslated regions (UTRs). Ten ORFs were 
identified within the genome. Gene 1 contained motifs common 
to all coronaviruses, including ribosomal frameshifting and slip- 
pery sequences, because ORF1b is translated in the —1 frame. 
The typical coronavirus structural genes encoding the S, E, M and 


N proteins were identified following Gene 1 (Fig. 1). The genome 
organization was determined to be as follows: 5’-UTR-Gene 1 
(ORF1a, 1b)-S-Gene 3 (ORFs 3a, 3b, E)-M-Gene 5 (ORFs 5a, 5b)-N- 
UTR-3’. 


3.2. Strains ck/CH/LNM/091017 and ck/CH/LDL/101212 are closely 
related to the H120 vaccine 


The analysis of the complete genome showed that strains ck/CH/ 
LNM/091017 and ck/CH/LDL/101212 possessed 99.9% and 99.8% 
nucleotide identity with H120, respectively. However, they shared 
91.3% and 91.4% identity with M41, respectively. Phylogenetic anal- 
ysis using the full-length genome and the 5’-UTR, Gene 1, S1, S2, 
Gene 3, M, Gene 5, N and 3’-UTR showed that the IBV strains ck/ 
CH/LNM/091017 and ck/CH/LDL/101212 consistently formed the 
same clade with vaccine-related strains of Mass-type (Figs 2 and 
3). The analysis of the S1 gene showed that strains ck/CH/LNM/ 
091017 and ck/CH/LDL/101212 had high nucleotide identities 
(99.8% and 99.8%, respectively) with H120, while they had 97.7% 
and 97.7% identity with M41. Multiple alignments revealed that 
there were 2 and 1 nucleotide mutations within the S1 gene be- 
tween strains ck/CH/LNM/091017 and ck/CH/LDL/101212 and 
H120; however, there were 36 and 39 mutations between strains 
ck/CH/LNM/091017 and ck/CH/LDL/101212 and M41. All these re- 
sults suggest that strains ck/CH/LNM/091017 and ck/CH/LDL/ 
101212 are closely related to the H120 vaccine strain. 


3.3. Strain ck/CH/LHLJ/100902 showed a close genetic relationship 
with pathogenic M41 


The percent nucleotide similarity between strain ck/CH/LHL]/ 
100902 and H120 for the full-length genomes was 91.3%; however, 
the percent similarity was up to 99.7% between strain ck/CH/LHLJ/ 
100902 and M41. In addition, in all of the nine trees constructed 
for the 5’-UTR, Gene 1, S1, S2, Gene 3, M, ORF5, N and 3’-UTR, 
the strain ck/CH/LHLJ/100902 constantly fell into the same clusters 
as the pathogenic M41 strain, and both belonged to the Mass-type. 
Pairwise comparison of the S1 protein gene revealed that strain ck/ 
CH/LHLJ/100902 had 3 and 41 nucleotide mutations with respect 
to M41 and H120, respectively. Taken together, these results dem- 
onstrate that strain ck/CH/LHLJ/100902 exhibits a close genetic 
relationship to pathogenic M41. 
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Fig. 2. Phylogenetic tree constructed on the basis of the full-length genomes of the IBV isolates and Turkey coronavirus (TCoV) reference strains using the neighbor-joining 
method. IBV strains ck/CH/LNM/091017, ck/CH/LHLJ/O7VII, ck/CH/LDL/101212 and ck/CH/LHLJ/100902 are highlighted in bold. Numbers on the branches represent the 
percentage of 1000 bootstrap samples supporting the branch. 


34 


Fig. 3. Neighbor-joining tree of non-overlapping portion of the 5'-UTR, Gene 1, S1, S2, Gene 3, M, Gene 5, N and 3’-UTR of nine Massachusetts strains. Numbers on the 
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branches represent the percentage of 1000 bootstrap samples supporting the branch: only values >50% are shown. 
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Fig. 3. (continued) 


3.4. Markers were found that distinguish between pathogenic and 
vaccinal Mass-type strains 


Pathogenic and non-pathogenic Mass-type strains were clus- 
tered into different clades by phylogenetic analysis of full-length 
genomic sequences and the 5’-UTR, Gene 1, S1, S2, Gene 3, M, Gene 
5, N and 3’-UTR. In addition, insertions and deletions were also ob- 
served that distinguished between the genomes of pathogenic and 
non-pathogenic Mass-type strains, as illustrated in Fig. 4 and Sup- 
plementary material 1. In non-pathogenic strains, five deletions: of 
31 nucleotides, 5 nucleotides, 9 nucleotides, 10 nucleotides, and 9 
nucleotides, respectively, were observed to be located in the nsp3 


Nsp 1/2(P87) 


3(PLpro) 


4(HD) 5(3CLP) 6(HD)7 8 9 1011 


of Gene 1. They were found to occur between genomic positions 
2897-2929, 3219-3225, 3241-3251, 3257-3268, and 3398-3407, 
respectively, by comparing the sequences with the homologous re- 
gions of pathogenic strains. In contrast, a 3-nucleotide and a 9- 
nucleotide insertion were found in nsp6 and between the M gene 
and gene 5, respectively. Additionally, a cluster of insertions was 
found at the 3’-UTR region in the non-pathogenic strains. These 
changes might not only account, at least partly, for viral fitness 
when the pathogenic virus has become adapted to egg embryos 
(Cavanagh et al., 2005; Hewson et al., 2012), but may act also as 
molecular markers, able to differentiate between vaccine and field 
strains, for diagnostic purposes. 


12(RdRp) 13 (Hel) 14(ExonN)15(Nendo)16(MT) 


Fig. 4. Deletions or insertions in the IBV genomes of Mass-type strains. The sites at which the deletions or insertions are located are indicated by respective arrows and 


numbered. 
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3.5. Strain CK/CH/LHLJ/07VII is a mosaic 


Comparative sequence analysis based on full-length genomic se- 
quences and the sequences of the 5’-UTR, Gene 1, S1, S2, Gene 3, M, 
Gene 5 and N showed that strain CK/CH/LHLJ/07VII clustered with 
pathogenic Mass-type strains. The exceptions were the trees con- 
structed using the 3’-UTR and S1 gene, in which CK/CH/LHLJ/07VII 
was grouped with non-pathogenic strains; this suggests that a pos- 
sible recombination event may have occurred. Thus the N and 3’- 
UTR of CK/CH/LHLJ/O7VII were carefully compared pairwise with 
those of strains ck/CH/LNM/091017, ck/CH/LDL/101212, ck/CH/ 
LNM/091017, H120 and M41. Parallel to the result of the phyloge- 
netic analysis, ck/CH/LHLJ/O7VII showed high similarity with M41 
at the 5’-end of the N gene; however, it showed high similarity with 
vaccine strain H120 at the 3’-end of the N gene (Supplementary 
material 2). The data strongly suggest that CK/CH/LHLJ/07VII arose 
from a homologous RNA recombinant event that involved a tem- 
plate switch between Massachusetts pathogenic M41-like and 
non-pathogenic H120-like strains. We located the switch site at 
the 3’-end of the N gene (Supplementary material 2), which implies 
that the template switch occurred within the N gene. 


3.6. Strain CK/CH/LHLJ/07VII has experienced evolution over time 


The percent nucleotide similarity between strain CK/CH/LHL]/ 
O7VII and H120, and CK/CH/LHLJ/07VII and M41, for the full-length 
genomes was 92.5% and 99.4%, respectively. Alignment revealed 
that a 3-nucleotide insertion was located in nsp6 of the CK/CH/ 
LHLJ/O7VII strain between genomic positions 24009 and 24013 
(Supplementary material 3). In addition, the S1 gene of strain CK/ 
CH/LHLJ/O7VII showed extensive mutations by pairwise compari- 
son (Supplementary material 4) though it was grouped with 
H120 by S1 gene phylogenic analysis. These and our previous re- 
sults (Liu et al., 2009) showed that, with the exception of the occur- 
rence of recombination events, CK/CH/LHLJ/O7VII has experienced 
multiple mutations and deletions in the genome over time. 


4. Discussion 


Understanding the evolution of Mass-type IBV is important be- 
cause not only is this virus circulating worldwide but information 
on virus genomics will aid our understanding of the evolution and 
emergence of IBV with infectious potential in vaccinated chicken 
flocks. In this study, we focused on the full-length genomic se- 
quences of four IBV isolates which had been shown to be of the 
Mass-type by S1 gene analysis (Liu et al., 2009; Ma et al., 2012; 
Sun et al., 2011). Based on the high degree of similarity in the full 
genomic sequence, it could be concluded that two IBV strains, ck/ 
CH/LNM/091017 and ck/CH/LDL/101212, were very similar to the 
vaccine strain H120. They might therefore represent re-isolations 
of vaccine strains, although they were isolated from vaccinated 
chickens with respiratory disease. Similarly, IBV strains that 
showed a very close relationship to the H120 vaccine strain have 
been isolated from unvaccinated broiler flocks in Slovenia with 
respiratory problems (Krapez et al., 2010). Alternatively, these 
strains might be variants of vaccine strains that have resulted from 
accumulated point mutations after several passages in chickens. A 
few key mutations in the S1 subunit of the spike protein might re- 
sult in a change to a new serotype, which is defined as a lack of 
cross-neutralization with specific sera against different IBV sero- 
types (Cavanagh et al., 1992). The point mutations found in the 
genome that distinguish between the two isolates and vaccine 
strain H120 might be the result of adaptive evolution driven by 
the host immune response when the vaccine strain was transmit- 
ted among chickens. Adaptive evolution is the process by which 


genetic changes in the viral genome leading to a more fit virus pop- 
ulation become fixed over time, and it has been reported to occur 
in many coronaviruses (Hasoksuz et al., 2007; Lee and Jackwood, 
2001; Shi et al., 2006; Tang et al., 2009; Zhang et al., 2006). 

As shown in this study, and as also occurs in other countries 
(Dolz et al., 2008; Rimondi et al., 2009; Roussan et al., 2009), the 
isolation of Mass-type IBV is expected, because attenuated vaccine 
strains are used extensively in chicken flocks in China. However, 
vaccination is not likely to be the only explanation for the circula- 
tion of Mass-type virus, because ck/CH/LHLJ/100902 and CK/CH/ 
LHLJ/O7VII were most closely related to a Massachusetts patho- 
genic type strain, M41. The isolation of a Massachusetts pathogenic 
strain from H120-vaccinated chicken flocks may be due to vaccina- 
tion failure in these flocks (Han et al., 2011). Alternatively, molec- 
ular studies have shown that only a few changes in the amino acid 
composition of the S1 spike protein can result in immune failure, 
even when the majority of the virus genome remains unchanged 
(Cavanagh et al., 1992). Our findings showed that mutations had 
occurred in the genomes of both ck/CH/LHLJ/100902 and CK/CH/ 
LHLJ/O7VII, especially in the S1 genes of CK/CH/LHLJ/O7VII though 
it was in the same group with H120 strain in the phylogenetic 
analysis, implicating that strain CK/CH/LHLJ/O7VII has experienced 
evolution over time. It has been reported that amino acid changes 
may result from immunological pressure caused by the wide- 
spread use of vaccines (Cavanagh et al., 1988, 2005). 

The occurrence of recombination events is another process that 
allows new strains to emerge, and this has been well documented 
in IBV (Hughes, 2011; Jia et al., 1995; Kottier et al., 1995; Kusters 
et al., 1990; Wang et al., 1993) and other coronaviruses (Fu and 
Baric, 1992, 1994; Makino et al., 1986). It is believed that the con- 
ditions for recombination amongst IBV strains in the field are as 
follows: an extremely large number of chickens, most maintained 
at high density; the ease of spread of the virus; and serotype co- 
circulation, including proof of co-infection with more than one 
serotype in a given flock (Cavanagh, 2007). In China, intensive 
chicken farms are concentrated in many provinces, including 
Heilongjiang, where ck/CH/LHLJ/O7VII was isolated (Liu et al., 
2009). Almost all the chickens in China receive Mass-type vaccines 
at a very young age and subsequently receive this vaccine a couple 
more times during the rest of their life span. Therefore the vaccine 
virus exists constantly in chickens; it may persist in various inter- 
nal organs for 163 days or longer (Cavanagh and Gelb, 2008). Gen- 
erally, vaccination using the H120 vaccine provides full protection 
against pathogenic Mass-type pathogenic IBVs and prevents the 
same type of pathogenic strain from being replicated and spread- 
ing in the flocks. However, a single amino acid substitution at posi- 
tion 63 of the S1 subunit of the spike has resulted in escape 
mutants of Mass 41 (Cavanagh et al., 1988). This may have 
occurred in the case of the ck/CH/LHLJ/O7VII S1 gene (Liu et al., 
2009), which might have made it possible for both pathogenic 
and vaccine strains to co-exist in a given flock, leading to the 
occurrence of recombination. Similarly, an escape mutant could 
be a result of adaptive evolution driven by the host immune 
response. Consequently, it is likely that genetic changes due to 
adaptive evolution and recombination both contributed to the ori- 
gin and evolution of strain ck/CH/LHLJ/O7VII: it is possible that 
adaptive evolution created a mutant, followed by recombination 
between Mass 41- and H120-like strains to create a novel virus. 

The recombination events from which the CK/CH/LHLJ/07VII 
virus resulted can be explained by a scenario in which the recombi- 
nation may have involved two parental viral strains, with initiation 
of RNA replication in a M41-like template of either negative or po- 
sitive polarity (Liao and Lai, 1992). This would be followed by 
switching of the polymerase-nascent cRNA complex to an H120- 
like virus template. The switch may have occurred at the 3’-end 
of the N gene. In general, for a recombinant virus to emerge and 
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establish itself in the field, it must be viable and have selective 
advantages. It has been reported that uptake of canine coronavirus 
(CCV) sequences by type II feline coronavirus (FCoV) may have led 
to increased viral fitness when compared with type I FCoV 
(Herrewegh et al., 1998). Recombination can also result in the 
emergence of new strains with distinct characteristics, such as 
pathogenicity and tissue tropism (Worobey and Holmes, 1999). In 
addition, in the CoV genome, as with most RNA viruses, the 3’-UTRs 
usually harbor important structural elements that are involved in 
replication and/or translation (Chang et al., 1994; Raman et al., 
2003; Raman and Brian, 2005; Goebel et al., 2006; Ziist et al., 
2008). In IBV, the 3’-UTR-binds to the N protein, which is essential 
for the synthesis of negative-strain viral RNA. Perhaps the acquisi- 
tion of the 3’ end of the N gene and the 3’-UTR from H120-like virus 
by an M41-like virus (e.g. CK/CH/LHLJ/O7VII) can alter the efficiency 
of viral replication. This alteration may in turn affect pathogenicity. 
However, it remains unknown whether this is the true origin of CK/ 
CH/LHLJ/O7VII, and therefore this strain is of particular importance 
to the surveillance of IBV in China. It will be of equal importance to 
examine future outbreaks of IBV in chickens by full-length genomic 
sequence analysis in the context of novel recombination events 
among IBV strains. Furthermore, investigations using reverse ge- 
netic systems might provide further insight into these issues and 
increase our understanding of IBV pathogenesis. 
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